Safety Embedded Differential Dynamic Programming Using Discrete Barrier States
نویسندگان
چکیده
Certified safe control is a growing challenge in robotics, especially when performance and safety objectives must be concurrently achieved. In this work, we extend the barrier state (BaS) concept, recently proposed for stabilization of continuous time systems, to embedded trajectory optimization discrete systems using states (DBaS). The constructed DBaS into model safety-critical system integrating system's dynamics objectives. Thereby, policy directly supplied by information through state. This allows us employ with differential dynamic programming (DDP) plan execute optimal trajectories. algorithm leveraged on various planning problems including wheeled robot navigation randomized complex environments quadrotor safely perform reaching tracking tasks. DBaS-based DDP (DBaS-DDP) shown consistently outperform penalty methods commonly used approximate constrained as well CBF-based filters.
منابع مشابه
Parallel discrete differential dynamic programming for multireservoir operation
The curse of dimensionality and computational time cost are a great challenge to operation of large-scale hydropower systems (LSHSs) in China because computer memory and computational time increase exponentially with increasing number of reservoirs. Discrete differential dynamic programming (DDDP) is one of the most classical algorithms for alleviating the dimensionality problem for operation o...
متن کاملDiscrete Dynamic Programming
We consider a decision problem taking place over time. In each time period, the single player can take an action by α ∈ A , an action space. All information relevant to the future is incorporated in a state variable y Y ∈ , the state space. The dynamics of y are determined by a transition probability π α ( ’ , ) y y . We define the set of states reachable with some probability under some circum...
متن کاملProbabilistic Differential Dynamic Programming
We present a data-driven, probabilistic trajectory optimization framework for systems with unknown dynamics, called Probabilistic Differential Dynamic Programming (PDDP). PDDP takes into account uncertainty explicitly for dynamics models using Gaussian processes (GPs). Based on the second-order local approximation of the value function, PDDP performs Dynamic Programming around a nominal traject...
متن کاملReceding Horizon Differential Dynamic Programming
The control of high-dimensional, continuous, non-linear dynamical systems is a key problem in reinforcement learning and control. Local, trajectory-based methods, using techniques such as Differential Dynamic Programming (DDP), are not directly subject to the curse of dimensionality, but generate only local controllers. In this paper,we introduce Receding Horizon DDP (RH-DDP), an extension to t...
متن کاملReceding Horizon Differential Dynamic Programming
The control of high-dimensional, continuous, non-linear dynamical systems is a key problem in reinforcement learning and control. Local, trajectory-based methods, using techniques such as Differential Dynamic Programming (DDP), are not directly subject to the curse of dimensionality, but generate only local controllers. In this paper,we introduce Receding Horizon DDP (RH-DDP), an extension to t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE robotics and automation letters
سال: 2022
ISSN: ['2377-3766']
DOI: https://doi.org/10.1109/lra.2022.3143301